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A cable television (CTV) system having an ad-insertion 
apparatus for automatically inserting commercial segments 
into program material under the control of cue tones trans- 
mitted by the program source. The system includes appara- 
tus for normalizing the audio signal levels of the program 
and commercial materials so that the audio portion of the 
output signal being transmitted to subscribers will have a 
relatively uniform loudness. The same concept may be 
applied to video signals. Additionally, signals coming from 
several channels may be normalized with respect to each 
other using the same technique. One aspect involves nor- 
malization of the audio level of the commercial, based on 
measured levels of the program audio preceding the adver- 
tisement In other variations, the program audio level is 
adjusted to match a preset audio level of an advertisement 
In another aspect of the invention, the audio level adjustment 
is achieved by monitoring the deviation of an audio modu- 
lator. In general the technique comprises generating com- 
posite CTV output signals in each of a plurality of CTV 
channels by generating a series of program segments and 
cue tones indicating the borders of the program segments 
and a series of commercial segments in response to the cue 
tones. Each CTV channel output is formed by alternately 
linking program segments with commercial segments at the 
borders in response to the cue tones. The channel outputs are 
combined for simultaneous transmission to subscribers. The 
loudness of the segments in one of the CTV channels is 
monitored. Volume attenuators are adjusted in each of the 
channels as a function of the loudness in one of the channels 
such that the loudness of the audio in the channels is 
normalized. 

38 Clafons, 12 Drawing Sheets 
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METHOD AND APPARATUS FOR 
NORMALIZING SIGNAL LEVELS IN A 
SIGNAL PROCESSING SYSTEM 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The invention relates to signal processing systems and 
methods and. particularly, to techniques for normalizing 
signal levels in a signal processing system. 

2. Description of the Prior Art 

Many electronic systems configure and format signals by 
linking together a series of signal segments obtained from a 
number of different sources. In such systems, it is usually 
important that the signal levels at the different sources be 
matched. Conventional television and radio transmission 
systems are notable examples. 

For example, subscriber television systems, such as cable 
television (CTV). normally deliver programs that are formed 
from a number of successive segments that originate at 
different sources. Many cable channel programmers set 
aside approximately four minutes in two blocks each hour 
for local advertising insertion. These advertisement blocks 
are sold by the local cable operator or by an advertising 
consortium of several cable systems. The cable operator 
automatically inserts the advertising in. for example, a 
satellite-delivered program coming from the programmer. 
The insertion is usually done locally under the control of cue 
tones transmitted by the programmer. At these specific cues, 
the cable operator switches different audio and/or video 
programming. Consequently, cable operators frequently 
encounter the problem of matching the audio and/or video 
levels between the different sources. This problem is par- 
ticularly acute in CTV systems where the system performs 
automatic switching with no human operator to adjust 
levels. 

In many prior art CTV systems, ad-insertion is handled by 
a combination of cue tone detectors, switching equipment 
and tape players which hold the advertising material. Upon 
receipt of the cue tones, a CTV insertion controller auto- 
matically turns on a tape player containing the advertise- 
ment Switching equipment then switches the system output 
from the video and audio signals received from the pro- 
gramming source to the output of the tape player. The tape 
player remains on for the duration of the advertising, after 
which the insertion controller causes the switching equip- 
ment to switch back to the video and audio channels of the 
programming source. When switched, these successive pro- 
gram and advertising segments usually feed to a radio- 
frequency (RF) modulator for delivery to the subscribers. 

Many subscriber television systems, such as CTV 
systems, are currently being converted to digital equipment 
In the future, video file-server systems will replace many of 
the conventional tape players. These new digital systems 
compress the advertising data, e.g.. using Motion Picture 
Experts Group 2 (MPEG2) compression, store the com- 
pressed data as a digital file on a large disk drive (or several 
drives), and then, upon receipt of the cue tone, spool 
('•play") the file off the drive to a decompressor. The video 
and accompanying audio data are decompressed back to 
standard video and audio, and switched into the video/audio 
feed of the RF modulator for delivery to the subscribers. 

One of the most critical problems confronting designers 
of CTV systems and other similar transmission systems, has 
been normalizing the audio and video levels between the 
programming and the advertising. It is generally known that 
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many subscribers have complained for years that the audio 
sounds higher during commercials than during program- 
ming. Although the audio during commercials can also 
sound low compared to the program level, few people 

5 complain in mat case. 

Consequently, those concerned with the development of 
radio, television, data, control and equivalent transmission 
systems have recognized the need for more effective signal- 
level normalization techniques. The present invention ful- 

io fills this need. 

SUMMARY OF THE INVENTION 

Therefore, it is an object of the invention to provide an 
improved signal-level normalization technique fox use in 
15 signal transmissions such as audio, video and data transmis- 
sions. 

It is another object of the invention to provide a signal- 
level normalization technique particularly suitable for use in 
digital networks. 

20 A further object of the invention is the provision of 
systems and methods of normalizing the signal level in a first 
signal block, such as the audio and/or video levels in a 
television program or a commercial, based on comparable 
measured levels of a second signal block, such as a prior 

25 program or a preceding commerciaL 

Still another object of the invention is the provision of 
systems and methods of normalizing the signal level in a 
signal block based on previously measured levels of the 

30 signal block. 

Yet a further object of the present invention is the provi- 
sion of a signal insertion technique capable of performing 
quality checks, such as verifying that the proper commercial 
was inserted into a predetermined location in the advertising 

35 block of a program. 

According to the invention, a signal processing system 
having a normalized output signal comprises a first signal 
source and a second signal source. A signal combiner 
connects to the first and second signal sources for forming 

40 an output signal by linking signal segments derived from the 
first and second signal sources into a series of the signal 
segments. A level processor connects to the signal combiner 
for deternuning a level of intensity of the output signal. A 
level adjuster connects to at least one of the signal sources 

45 and responds to the level processor for adjusting a level of 
intensity of the signal segments at least one of the signal 
sources such that the level of intensity of the output signal 
is normalized. 

More specifically, the invention provides a signal trans- 

so mission system fox producing a composite output signal 
formed by linking signals from a plurality of signal sources 
comprising a first signal source for generating a series of first 
segments and cue tones indicating the borders of the first 
signal segments, and a second signal source connected to the 

55 first signal source. Responsive to the cue tones, the second 
signal source generates a series of second signal segments. 
The first and second signal segments include audio signals. 
A signal combiner is connected to the first and second signal 
sources for forming the composite output signal by alter- 

60 nately linking the first signal segments with the second 
signal segments. A level processor connects to the signal 
combiner for determining the loudness of the audio portion 
of the composite output signal. A level adjuster connects to 
at least one of the signal sources and responds to the level 

65 processor for adjusting the loudness of signal segments from 
the signal sources such that the volume of the output signal 
is normalized. 
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Still another aspect of Che invention involves a signal of FIG. 13 with a conventional A-weighting curve which 
transmission method for transmitting an output signal by closely resembles the frequency response of a typical human 
generating composite output signals in each of a plurality of car- 
signal channels. The composite output signals in each chan- DETAILED DESCRIPTION OF THE 
nel is formed by generating a series of first signal segments 5 PREFERRED EMBODIMENTS 
and cue tones indicating the borders of the first signal Referring no w to the drawings, FIG. 1 shows a general 
segments, and generating a series of second signal segments ^ of ctv system 20. It is to be understood that 
in response to the cue tones. Each channel output is formed ^ utficulBS CTV systems described herein are exemplary 
by alternately Unking the first signal segments with the an ^ at me invention has application in other equivalent 
second signal segments at the borders in response to the cues 10 ^ q ^ ^ othef equivalent transmission sys- 
tones. The channel outputs from each channel are combined tems ^ system 20 deludes earth station receiver 21 
for simultaneous transmission to users. The method includes provides decoded program material, both video and 
foestepofdetemiiiungalev^^ audio \ { {ts ^ 22. Earth station receiver 21 
channel output and adjusting a level of intensity of each of comp rises conventional receiver and decoder equipment. A 
the channel outputs as a function of the intensity in the first is receiver 21 nonna u y outputs analog video (e.g.. in 
channel output such that the levels of intensity of the channel £ National Television System Committee (NTSC) format), 
outputs are normalized plus 0QC to fhsec channels of audio. The audio channels are 

These and other objects, features and aspects of the ra ost commonly left- and right-channel stereo and possibly 
invention will be more clearly understood and better a monaura i channel for a second audio program (SAP), 
described if the following detailed description is read in 20 often m another language. Many cable operators contem- 
conjunction with the appended drawings wherein: pla|e replacing these analog systems with more modern 

BRIEF DESCRIPTION OF THE DRAWINGS CTV systems that output digital video and/or audio 

signals from receiver 21. 

FIG. 1 is a system block diagram of a cable television ^ to mc video md audio channels, receiver 21 

head-end system constructed in accordance with a preferred 25 tfansmits cue tones oq line 23 to ad-insertion system 24 
embodiment of the invention. wnicn usc$ me m t0 generate tuning signals for 

FIG. 2 is a graph of loudness of an audio signal on a controlling the insertion of advertising into the program 
logarithmic scale vs. time, which is useful in understanding material In most systems, the cue tones are a series of 
the preferred embodiment of FIG. 1. dual-tone multiple-frequency (DTMF) tones which identify 

FIG. 3 is a detailed block diagram of a cable television the programmer and the insertion times in the ad-insertion 
head-end system constructed in accordance with the pre- process. Typically, different cues are transmitted fox pre-roll 
f erred embodiment of FIG. 1. (an advance in time to allow a tape machine to get up to 

FIG. 4 is a graph, similar to that of FIG. 2, showing speed), transfer-to-ad (a time at the beginning of the adver- 
loudness of an audio signal on a logarithmic scale vs. time, 35 tisement block) and return (the conclusion of the advertise- 
which is useful in understanding the preferred embodiment ment block). 

of FIGS. 1 and 3. Timing signals, which ad-insertion system 24 outputs 

FIG. 5A is a flow chart illustrating process steps per- onto lines 34, operate transmission switch 25. Ad-insertion 
formed by the preferred embodiment of FIGS. 1 and 3. system 24 generates these timing signals in response to the 

FIG. SB is a flow chart, similar to that of FIG. 5A. 40 cue tones received on line 23. When operated, switch 25 
illustrating alternate process steps performed by the pre- routes either the program audio and video, which appear on 
fenedenLiimentof lTOS. l andT lines 22, or the advertising video and audio, which appear on 

IS. J^a^ed block diagram, similar to that of FIG. ^ *>> to mc4ulator 30 via lines 30. 
3, tfan a?te^^ ofme invention. t*cs*t video and audio ^J^^^^ 

. ♦ . ^fTTrn a 45 to subscribers along with all other channels via combiner 28. 

FIG. 7 is a graph, similar to that of FIG. 4. showing « Uq ^ x<s ^ ^ often include other equipment that 
loudness of an audio signal on a loganthmic scale vs. dme, rcceiycs ^ processes the signals, such as scramblers and 
which is useful in understanding the embodiment of FIG. stcfeo cnco ^ & ufo. 

FIG. 8 is a flow chart illustrating process steps performed Consequently, in response to appropriate cue tones on line 
by the embodiment of FIG. 6. 50 23, ad-insertion system 24 automatically inserts commer- 

FIG. 9 is a system block diagram, similar to mat of FIG. c ^ s mto advertising blocks by spooling advertising material 
1. of a cable television system constructed in accordance onto u Des 27 and, hence, modulator 30 via switch 25 and 
with another alternate embodiment of the invention. 33 j 0 provide normalization between the audio and/or 

FIG. 10 is a detailed block diagram, similar to mat of FIG. video levels of the commercials with that of the program, 
6 ( of the alternate embodiment of the invention shown in 5J i cvc j processor circuit 26 monitors the signals being 
FIG. 0. switched onto lines 33. Level processor circuit 25 adjusts. 

FIG. 11 is a detailed block diagram of a commercial v ia lines 31, the appropriate signal levels of the advertising 
verification system in accordance with the present invention. material being outputted by ad-insertion system 241. 

FIG. 12 is a flow chart illustrating process steps per- As described above, a primary problem addressed by mis 
formed by the system of FIG. 11. 60 invention involves correcting a possible mismatch between 

FIG. 13 is a detailed circuit schematic with parameter the audio level coming from receiver 21 and that coming 
values for the various elements thereof, representing a from ad-insertion system 24. Each apparatus represents a 
particular implementation of a level detector that forms a source of audio that is prepared at a different location by 
part of the various preferred emtodiments of the present different people using different equipment While efforts 
invention. 65 have been made in the past to normalize the audio levels 

FIG. 14 is a graph showing a plot of response in decibels from these different sources, limited success has been 
(dB) vs. frequency for comparing test results for the circuit achieved. 
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Typically, a level adjustment device resides within 
ad-insertion system 24 to allow operating personnel to 
manually or electronically correct for gain variations in the 
equipment. These level adjustment devices are often a 
source of possible errors due to frequent miss-adjustments. 
Still further, the gain of various pieces of system equipment 
will often shift with time and temperature. 

Yet another problem in the CTV industry is that the level 
coming from receiver 21 may be inconsistent from one 
channel to another. While most receivers and decoders used 
in the industry have fixed output levels for a fixed input at 
the uplink, different programmers operate uplinks with dif- 
fering input levels. This is partially because different types 
of program material require different amounts of headroom, 
and different uplink engineers have different philosophies 
about the amount of headroom to allow. Because of this, 
level adjustments in ad-insertion system 24 have become 
necessary. 

FIG. 2. which plots loudness vs. time, illustrates the 
problem graphically. This graph shows the relative audio 
levels of four consecutive segments consisting of two pro- 
gram segments PI and P2. and an advertising block with two 
commercials CI and C2 located therebetween. Program 
segments PI and P2 and commercials CI and C2 have their 
audio levels varying over time. The first program segment 
PI extends between times tl and t3. An advertising block, 
starting with commercial CI. follows program segment PL 
The graph shows commexcial CI having a higher loudness 
than did the preceding program segment PI. Next in the 
advertising block is second commercial C2. Because it was 
recorded differently than was first commercial CI, the graph 
shows it playing louder than commercial CI. The successive 
commercials CI and C2 extend between times t3 and t4. 
Finally, second program segment P2 follows second com- 
mercial C2. and plays at the lower loudness level of program 
segment PI. 

The variations in loudness illustrated in FIG. 2 usually 
produce the irritating phenomenon sometimes called * 'blast- 
ing" out the commercial. As mentioned above, broadcasters 
and cable operators have been accused of doing this inten- 
tionally to gain listener attention for the commercials. 
However, usually the source of the problem is more mun- 
dane: either the commercial automatically plays and no one 
is available to correct the audio level, or metering problems 
make an objective comparison of the levels difficult 

FIG. 2 also shows the transmission times of the cue tones. 
The pre -roll cue tone arrives at time t2. which is usually 
about five to eight seconds before the commercial starts at 
time t3. to allow tape equipment to get up to speed before 
being played at time t3. A transfer cue tone arrives at time 
G to insert first and second commercials CI and C2 into the 
advertising block. Finally, a return cue tone arrives at the end 
of the commercial break, time t4, to instruct the system to 
return to the program material supplied from receiver 21. 

Referring now to FIGS. 1-4. a normalization technique in 
which the advertisement audio level is adjusted to match the 
level of the program audio will be described in detail. FIG. 
3 depicts receiver 21 as having two channels, namely 
program video source 21V and program audio source 21A. 
Modulator 20 includes video modulator 30V and stereo 
encoder modulator 30 A. FIG. 3 shows ad-insertion system 
24 of FIG. 1 as having audio and video channels designated 
as advertisement video source 24V and advertisement audio 
source 24 A. 

FIG. 3 schematically depicts switch 25 as comprising a 
plurality of ganged switches, namely video switch 51 and 
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audio switches 52. 53 and 54. being suitably switched by 
signals on lines 34. One input side of video switch 51 
connects to program video source 21V. The other input side 
of video switch 51 connects to advertisement video source 

5 24V. The output side of video switch 51 connects to video 
modulator 30V. 

One input side of each of audio switches 52-54 connects 
to respective left right and SAP channels of program audio 
source 21 A. The other input sides of audio switches 52-54 

w connect to the respective left right and SAP channels of 
advertisement audio source 24A via respective attenuators 
55, 56 and 57 which function as audio level controls. The 
output sides of audio switches 52-54 connect to stereo 
encoder modulator 3® A. 

15 The output sides of audio switches 52 and 53 also connect 
to adder 39 of level processor circuit 26 via lines 33. 
Because the preferred manner of evaluating the level of a 
stereo signal is to evaluate the sum of its left and right 
channels, adder 39 sums these stereo signals. Level detector 

20 40 receives the summed output of adder 39 and inputs the 
inverting (-) side of comparison amplifier 32. Level detector 
40 receives the audio signals at its input, and outputs a 
voltage corresponding to the loudness. The setting of switch 
25 will determine which audio signals adder 39 sums. With 

25 respect to the graph of FIG. 2, the output of level detector 40 
will be a voltage that varies, preferably as a decibel (dB) 
function, in accordance with the four curves representing the 
loudness of program segments PI and P2 and commercials 
CI and C2. 

30 

Audio reference level circuit 41, represented by a Zener- 
dtode voltage source, provides a fixed reference level LI at 
the (+) input to comparison amplifier 32. Thus, comparison 
amplifier 32 outputs the difference between the actual vol- 

35 ume level of the audio source, as detected by level detector 
40, and the reference volume level LI which has been preset 
in reference level circuit 41. Analog-to-digital (A/D) con- 
verter 44 samples the output of comparison amplifier 32. 
Consequently. A/D converter 44 outputs these sampled 

^ differences to logic 42 as a series of digital words propor- 
tional to a dB audio level Logic 42 communicates with a 
memory 43. As will be described below in detail, logic 42. 
which monitors the output of A/D converter 44. adjusts 
attenuators 55-^57 via attenuator control lines 31 in accor- 

45 dance with error values calculated by logic 42. To perform 
its level adjusting functions, logic 42 derives control and 
timing signals from insertion controller 46, which is a 
conventional part of ad-insertion system 24. 
In response to the cue tones transmitted from program 

so video source 21V on line 23. insertion controller 46 operates 
transmission switch 25 and sends control signals to logic 42. 
video source 24V and audio source 24A. Further, insertion 
controller 46 secures appropriate synchronization (sync) 
pulses and timing signals from sync separator and timing 

55 circuit 47, which is also a part of conventional ad-insertion 
system 24. Advertisement video source 24V uses the output 
of circuit 47 to synchronize its video with program video 
source 21V. Additionally, insertion controller 46 acquires 
appropriate timing signals from timing circuit 47 so that it 

50 can initiate spooling of the commercial when the appropriate 
pre-roll cue tone on line 23 is received, for example, at time 
t2 of FIG. 2. Still further, insertion controller 46 operates 
transmission switch 25. via switch control lines 34. in 
accordance with the appropriate cue tones transmitted on 

65 line 23. 

It is noted that the SAP channel, if supplied, is controlled 
separately, based on its own characteristics. However, in the 
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interest of darity. this description and the related drawings In calculate STEP 76. logic 42 calculates audio level 

omit the details of the SAP controls. However, it will be errors Ed based on the current value of the target level and 

readily understood by those skilled in these arts that the SAP the output sample at the start of a commercial. For example, 

channel, and any other audio or data channels, may be immediately after insertion controller 46 transmits transfer 

treated in a similar manner as are the left and/or right stereo 5 trigger, i.e., at time t3+. logic 42 calculates error El as the 

channels described herein. difference between the loudness level at the start of com- 

The operation of CTV system 20 will now be described mercial CI and the target level. Logic 42 next performs 

with reference to the circuit of FIG. 3. the sample curves adjust STEP 77 by applying the current audio level error El. 

depicted in FIG. 4 and the FIG. 5A flow chart As seen in yj a 1^ 31, to adjust the preset attenuators 5S and 55 in a 

FIG. 4. the audio level over any length of time is not , 0 direction that will cause the audio level of commercial CI to 

constant; typical program audio may range from a whisper move ^ to ^ targa level. 

to the loud screech of an airplane toMng off. Thus, it is ^ reference level LI represents the expected 

necessary to be precise as to what is meant by the level of ^ ^ a&mtitaaati ^ nolma Uy obtained by 

a program. For the present description, one preferred strat- reconlill (compre ssing) all commercials at a nominally 

egy is to match the last few seconds of me program audio „ ld e Dnca l audio level to level U . FIG. 4 shows the 

level (right end of program segment PI) with the first few ay difference D and the audio level error El to be 

seconds of the following commercial (left end of commer- ^^tiaUy cquaL which illustrates the typical situation 

cial CI). Logic 42 performs this function by keeping a whgre cl was actually recorded at the nominal 

running log in memory 43 of "level measurements based on or expected leve l LI. On the other hand, FIG. 4 also shows 

an average of the last portion of the program material. At the M ronU jj erc j a j C2 as having been recorded slightly higher than 

time the program plays, this average becomes the larget nominal OT ^ Ieve i u . 

level" to which the corr^c^ audio leve i s etched ^ ^ rf P1 just 

J^JT? a * f '? a l ^SS'S prior to commercial time transmits louder than the expected 

42 ^ <X ^°,^ %^r«, T ^m?n^ or nominal level LI. the level of ooinmerdal Cl will be 

monitor STEP 70 (see FIG. SA) andcontinues to do so until „ og attenuators SS and 56. On the other 

the system is turned off. Duxujg te.V^«1W«* handjftoe audio level of program segment PI is lower than 

segment PI, transmission switch 25 is in its normally up ... , , „ iLS-, in Sro & t^lc 42 adiusts 

and right channels of program audio source 21 A for program ^ ™* ,„„„, . „„, 

segment PI. Consequently, the output of comparison ampli- 30 doWD w levcL 

ta3B iS vary with the dB difference between program Ideally mis would be the end of the process. However, as 

segment PI and fixed reference level LI. a practical matter. U is ^^^J^J^J^ 

Upon receiving the pre-roll cue tone at time t2. insertion commerces wdl be recorded ar Ate tgbt leva Tie 

controller 46 transmits a pre-roll trigger to logic 42 in trigger commercials will be recorded at differen tomes by different 

£l-P 71. thereby erasing to storein mcnTory 43 a series 35 "^viduals using different j£^>^™"*J 

of output values from A/D converter 44 as indicated by store different aucUo processing ; system* juri a^y^ckfferent 

STEP 71. Logic 42 next receives, in trigger STEP 73. a ideas of how to interpret the audtoleveL Thus, tte level of 

transfer trigge? from controUer 46 at time & At this point. comineraab will usuaUy not be the same Tte is 

logic 42 uses the series of previously stored values, in floated "> 4 by me difference between the levels of 

calculate STEP 74. to calculate and store the average output 40 corameraals Cl and C2. ^ 

of A/D converter 44 between times t2 and t3. This average Logic 42 continues to monitor the output of controller 46, 

represents the average difference D between program seg- waiting for the next trigger signal. In the absence of a return 

ment PI and reference level LI for the period t2-t3 (see the cue tone, insertion controller 46 signals advertisement video 

FIG. 4 graph). Using mis average difference D and reference source 24V and audio source 24A to start playing commer- 

levcl LI. in calculate and store STEP 75. logic 42 calculates 45 cial C2 at the end of commercial Cl. At this time, controller 

a target level and stores that value in memory 43. 46 transmits a next-commercial trigger to logic 42 which 

U is noted that many conventional ad-insertion equipment exits the NO path of decision STEP 78 and proceeds to 
know within a minute or two when a commereial is to be trigger STEP 79. In response to receiving the next- 
inserted, ic. when to expect to receive a pre-roll cue tone. commercial trigger, logic 42 increments index jjjn index 
Because of this, the program segments may be monitored » STEP ^.^tr* process 1 returns to catatlate STEP 74 In 
and averaged over a time period other than period t2-t3. calculate STEP 76. logic 42 calculates the new radio level 
which usually lasts Only five to eight seconds. For example. error E2. which represents the smaU additional amount that 
the average difference D may beltetermined over a one or attenuators 55 and 56 must be adjusted, in adjust STEP 77. 
two minute period just prior to the advertisement block. to move the level of commercial C2 down to the target leveL 

As discussed above, at time t2 insertion controUer 46 55 When controller 46 receives a return cue from program 

triggers advertisement video source 24V and audio source video source 21V attimet4.it switches transition switch 

24A to initiate the advertising process. Le.. to turn on the 25 back to the normally up position of FIG. 3. thereby 

tape player to get it up to speed or. in the case of a digital disconnecting advertisement video source 24V. and I audio 

system, to retrieve digital data from the appropriate file source 24A and reconnecting program video source 21V and 

servers. Further upon receiving the transfer cue tone at time «o audio source 21A to modulator 30. Controller 46 also 

t3. insertion controller 46 switches transmission switch 25. transmits return trigger to logic 41 In response, logic 42 

via lines 34. to the down position as viewed in FIG. 3. At this exits decision STEP 78 on the YES path and returns the 

point, the advertisement video and audio channels are process back to monitor STEP 70. 

switched onto output lines 33 of switch 25 for transmission FIG. 5B, which is similar to FIG. 5 A. depicts a modified 

to the subscribers. As such, adder 39 now monitors and adds 65 normalizationprocess in which the audio level offset of each 

the left and right channels of advertisement audio source commercial is stored on its first playing so that upon 

24A. subsequent playing, the volume correction of that commer- 
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rial may be set to an approximately correct value without the a channel is shared during different times. During the day the 

need to play, monitor and measure the commercial's level. channel might be used for educational programming and 

In performing the FIG. 5B process on the program- during the evening the channel might be used for pay 

commercial sequence of FIG. 4, logic 42 performs STEPS programming. This necessitates the switching of inputs to 

70-75 to obtain the target level. At this point, logic 42 5 the receiver channel If the audio levels from the two 

determines, in decision STEP 69. if the commercial about to different sources are different, which is the rule rather than 

be played, e.g. commercial CL has been previously played. the exception. CTV system 20*. an alternate embodiment 

To do this, logic 42 looks up table entries in memory 43 that normalizes these differences. 

have been previously stored, listing commercial identifica- FIG. 6 illustrates CTV system 20* which monitors the 

tion (ID) data, corresponding audio level errors Ep and its ]Q level of a program segment, e.g. program segment PI. and 

related target level. Controller 46 provides logic 42 with the adjusts its level to match the nominal commercial reference 

appropriate ID data. If this is the first playing of commercial level LI. Additionally, the commercials have their levels 

C 1, the process exits STEP 69 along the NO path to calculate corrected, if necessary, as they begin to play. The advantage 

error El . in calculate STEP 76. and proceeds to adjust STEP of this embodiment is that if a program level changes as a 

77 to adjust attenuators 55 and 56. Next, logic 42 stores, in 15 result of uplink errors or system gain changes, the level 

store STEP 68. appropriate table entries in memory 43. correction subsystem automatically corrects the match for 

namely, the ID data for commercial CI. the corresponding these errors and/or changes. 

error Ep equal to the current error El and its related target Jq addition to the elements that make up CTV system 20, 

level. If logic 42 receives a return trigger in de cision STEP CTV system 2ff has three attenuators 65-67 in the program 

78,itexitsmeYESr>athandrettnTistoinoiutorSTEP7©vk ^ audio path. More specifically, CTV system 20' shows attenu- 

index STEP 86 which increments index n. However, i f logic ators 65. 66 and 67 connected in the respective left, right and 

42 should receive a next-commercial trigger in trigger STEP SAP channels of program audio source 21A. Attenuator 

79, the process returns to decision STEP 69, via index STEP control lines 38 connect logic 42 to the control terminals of 

80 which increments index n. and the process repeats fox program attenuators 65. 66 and 67. By adding program 

commercial C2. ^ attenuators 65-67. CTV system 20' functions to normalize 

However, if the commercial to be played has played errors in the program audio levels. As such. CTV system 20' 

previously, the process exits the YES path from decision automatically normalizes unwanted variations in program 

STEP 69. Logic 42 uses the commercial's ID data to levels that may be caused by. for example, inconsistent 

accesses the appropriate table entry in memory 43. In adjustments made by those who operate CTV head-ends, or 

calculate STEP 63. logic 42 calculates error Er using the 30 audio gain variations in the head-end that change with time 

current target level, error Ep, and its related target level. or temperature. 

Logic 42. in adjust STEP 64, next adjusts attenuators 55 and The audio level normalizing technique of CTV system 20' 

56 using audio level error Er. The process then moves to parallels that of CTV system 20. except that system 20' 

decision STEP 78 and proceeds further in the manner contains enough controls to allow the audio of program 

described above. 35 segments PI and P2, and the audio of commercials CI and 

Pre-correcting the loudness setting for the commercials, C2 to be set to reference level LI. This process will now be 

as just described with respect to FIG. 5B, requires that tables described with reference to FIG. 6, the curves in FIG. 7 and 

be constructed for all commercials in the system. Since the FIG. 8 flow chart At the start monitor STEP 9$ initiates 

insertion controller 46 usually provides all commercials with the monitoring of the difference values outputted by A/D 

ID data uniquely associated with that commercial, logic 42 40 converter 44. In define STEP 92. logic 42 sets the stored 

can readily organize the table using that ID data. In the value of the target level to be equal to reference level LL 

preferred embodiment, the table is duplicated for each which FIG. 7 shows to be within the range of commercial 

channel on which the commercial airs. This will accommo- CI. Upon receiving pre-roll trigger, in trigger STEP 81, 

date any differences in the gain of comparison amplifier 32 logic 42 performs store STEP 82. trigger STEP 83 and 

or other errors from channel to channel 45 calculate STEP 84 to find the average difference D at the 

Many other modifications and variations are possible in output of A/D converter 44 over time period t2-t3 (or other 

the light of the above teachings. For example, in order to time period as described above). Logic 42 uses this average 

allow the above-described level setting activities to proceed difference D. which represents the difference between the 

without undue computational stress being required of logic target level (equal to reference level LI) and the audio level 

42, level detector 40 is preferably configured to output the 50 of the last portion of program segment PI. as an audio level 

measured level differences in decibels (dB). a well-known error signal fox adjusting program attenuators 65 and 66. 

operation which converts ratio (division) operations into FIG. 7 shows the original position of program segment P2' 

addition. Those skilled in these arts can readily design the being shifted to the new position of program segment P2 as 

output voltage of level detector 40 to be proportional to the a result of these adjustments. 

dB level of the audio differences. FIG. 13. to be described 55 After adjusting program attenuators 65 and 66. in adjust 

below in detaiL includes a specific implementation of level STEP 87, logic 42 exits the NO path of decision STEP 88 

detector 40 with a dB output Attenuators 55-n57, therefore, and proceeds to set STEP 89. At this point, logic 42 stores 

attenuate the audio signals in a conventional dB relationship jn memory 43 the output of A/D converter 44 as audio level 

with respect to errors En. Although the conversion to dB error En, which at this point in the present example equals 

representation may be performed intrinsically In level detec- 60 error El. Logic 42 adjusts, in adjust STEP 92. commercial 

tor 40. as depicted herein, it is possible and often preferable attenuators 55 and 56 using audio level error El. Because it 

to take the functions of audio reference level circuit 41 and was assumed that commercial CI was recorded substantially 

comparison amplifier 32 into the digital domain by digitiz- at the nominal advertisement recording reference level LI. 

ing the output of level detector 40. audio level error El equals zero, making attenuator adjust- 

A source of error in the audio level output to the sub- 65 ments at this point unnecessary. In the absence of receiving 

scribers occurs when the input sources at receiver 21 change a return trigger, in trigger STEP 93. logic 42 follows the NO 

from time to time. Such will be the case, for example, when path to trigger STEP 95 via index STEP 94 where index n 
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is incremented. Upon receiving the next-commercial trigger sync tip amplitude. It is known to those skilled in the art how 

from controller 46. in trigger STEP 95, logic 42 stores in to measure sync amplitude. In addition, if a second set of 

memory 43 audio level error E(n+1) as being equal to the level control attenuators arc placed in the program video 

current output of A/D converter 44. At this point in the source output such as in lines 22 of FIGS. 3 and 6, they 

present example, logic 42 stores audio level error E2 as the 5 could be used to correct for incorrect video level coming 

difference between the audio level at the start of commercial from that program video source. 

C2 and the target level. Le., reference level LL As such. FIGS. 9 and 1© show yet another embodiment Here CTV 

adjust STEP 92 performs an adjustment to advertisement system 20" corrects audio-level differences that may be 

attenuators SS and 56 so that commercial C2 will play at or introduced by audio-level adjustments contained in conven- 

near reference level LI. This process is repeated for sue- 1Q tional modulator equipment A control device on a conven- 

ceeding coinmexcials. tional modulator, e.g.. modulator 30, is frequently labeled 

When logic 42 receives a return trigger from controller "DEVIATION," because it controls the deviation of the 

46 at time t4 in decision STEP 93, it returns the process to sound carrier: the greater the audio level the greater the 

trigger STEP 81 to await reception of the next pre-roll deviation. If the operators set the modulator deviations 

trigger at, for example, time t5. This sequence takes place l5 inconsistently from one channel to the next the associated 

via the YES path of decision STEP 93. When controller 46 audio levels will change when subscribers are tuning chan- 

transmits the next pre-roll trigger at time t5. logic 42 nels.Thus, controlling the deviation (sound level) between 

proceeds through STEPS 81-84, thereby obtaining a new channels of a multi-channel CTV system 20". as depicted in 

average difference D for the final portion of program seg- FIGS. 9 and 10, creates further normalization problems, 

ment P2 over time period i5-t6. FIG. 7 shows the new ^ The normalization technique of CTV system 201" paral- 

average difference D, over period t5-t6, to be significantly lels that of CTV system 20' of FIG. 6, with a notable 

smaller than the difference D* that would have occurred exception that level detector 40 derives its audio input 

absent the prior adjustments made to program attenuators 65 signals form different points. CTV system 20 derives its 

and 66 in adjust STEP 87 at time t3. audio level information from the individual subscriber 

The level adjusting procedures so far described measure 25 channels, viz.. lines 33. On the other hand, CTV system 20" 
the average level of the program segment over a short time derives its audio level information from a point common to 
period at the end of the segment, e.g.. between the pre-roll all subscriber channels. More specifically, while CTV sys- 
and transfer cues. In the FIG. 7 scenario, this procedure tern 20' uses the left and right audio channels outpuued by 
results in program segment PI running at a level much lower receiver 21 and ad-insertion system 24 via adder 39. system 
than target level LI. It is also noted that in the embodiments 30 20" derives the audio from the main subscriber lines 101. 
so far described, logic 42 adjusts the attenuators, e.g.. Level processor circuit 26* of system 2©", employs tune- 
attenuators 55-57 and/or 65-67. within a relatively short able demodulator 60 to selectably monitor the audio outputs 
time period that would normally go undetected by the of the various subscriber channels. The audio input of 
viewer. For example, after the average level of program demodulator 60 connects to the main subscriber lines 101 
segment PI in FIG. 7 is detected over time period t2-t3, the 35 via directional coupler 61. When monitoring a particular 
level of commercial CI is adjusted (if necessary) within a channel, demodulator 60 tunes to the output of the appro- 
short period that should go undetected by the listening or priate channel modulator, e.g., modulator 30A or an eqiriva- 
viewing audience. However, these functions can be modified lent audio modulator (not shown) in one of the other 
in some situations by measuring the average program levels channels 100. A preferable implementation of demodulator 
over one or more other time periods as the program segment 40 60 includes a conventional agile television demodulator with 
runs while making small periodic attenuator adjustments to a calibrated audio output and a tuner suitable for control by 
gradually bring the program ox conimercial level into line controller 46. In the television system used in North America 
with the target level. Using this modified procedure with the (NTSQ, the audio on main subscriber lines 101 deviates the 
FIG. 7 scenario, the first program segment PI can be audio carrier by 25 Irilchertz (KHz). In this regard, demodu- 
gradually brought into line with target level LI well before 45 iator 60 is preferably calibrated such that its output voltage 
receiving the pre-roll cue at time t3. The application of this equals a known value for this 25 KHz deviation. As such, 
steady level adjustment process will be described below in this known value corresponds to audio reference level LI. 
greater detail with respect to the embodiment of FIGS. 9 and Because demodulator 60 produces a monaural sum signal. 
10. level processor circuit 26' does not include an adder similar 

It has been noted above that the teachings of this invention 50 to adder 30 of systems 20 and 20'. 
apply equally well to other types of signals, including video. For CTV system 20". the cue tones received by commer- 
data. control, etc. Video levels are usually easier to control cial insertion controller 46 transmit on line 23 from program 
than are audio levels because a well defined relationship video source 21V and on lines 23* from similar equipment 
exists between the desired picture information and its sync (not shown) in the other channels 109. In addition to timing 
signal level Because the sync signal level is normally 55 information, these cue tones include channel tuning infor- 
consistent from one program segment to all others, it may be mation which insertion controller 46 uses to direct the 
easily used to set video level. However, a few cases exist in ad-insertion process of ad-insertion system 24\ In this 
which it is desirable to automatically set the level of the regard, ad-insertion system 24' is a conventional system 
video signal other than using the sync signals. One important having multiple channels for simultaneously inserting corn- 
instance occurs when different video sources are being 60 mercials in more than one subscriber channel via lines 27 
switched to a common output channel, or when video system and 27'. Additionally, insertion controller 46 uses the chan- 
gain changes over time. nel tuning information to automatically tune audio demodu- 

Therefore, if level control attenuators are placed in the lator 64> from one subscriber channel to another. When a 

output of an advertisement video source, such as in lines 27 program segment e.g., program segment PI of FIG. 2, nears 

of FIGS. 3 and 6, they can be used to match the video level 65 its end on any channel, commercial insertion controller 46 

of a commercial to that of the program video source. A tunes audio demodulator 60 to that channel for performance 

detector would detect the level of the video by measuring of its appropriate normalization process. 
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The audio level normalization control loop for each known that the pilot level on stereo channels is not auto- 
channel in CTV system 20" includes attenuators 55-57 and matically set correctly. After a stereo encoder generates such 
65-67. and stereo encoder and audio modulator 30 A. These pilot levels, they are modulated onto the audio carrier. In 
items will be unique to each channel. This normalization some equipment the adjustment of the pilot level is a factory 
control loop also includes level processor circuit 26' which 5 calibration, and in other equipment is a field adjustment. It 

comprises demodulator 60. level detector 40. amplifier 32. is possible to add to level processor 26*. for example, a 

A/D converter 44 and logic 42. These items are shared by all channel tuned to the standard 15.734 KHz pilot frequency, 

of the channels. Level processor circuit 2S normalizes the The level of the pilot could then be measured. Due to the 

program audio from all channels, including other channels way equipment is normally built, it will not be readily 

100. and commercial audio being routed through all chan- possible to automatically correct the pilot level, but a visual 

nels from ad-insertion system 24*. As such. CTV system 20" or audible indicator could notify an operator of a problem, 

offers a closed-loop control of audio level that includes M sug gested above, those skilled in the art are aware that 

normalization of ^mismatched levels due to several sources measuring audio level is extremely difficult, in part by virtue 

including the audio modulators. of me fact that normal prograrnmkg includes constantly 

Having the dements of tuner processor circuit 26" com- varying audio levels, possibly covering a very large dynamic 

mon to all channels both minimizes cost and promotes more " Fof a a loud 

consistency from one channel to another. The use of com- *Ts ^ . . . , T„ . . , 

mon equipment does, however, preclude the measurement of ^tween two inchv.duals, foUowed by an abaost silent scene 

the audiolevel just before a commercial on a plurality of J*"!*? out of * c h ° use and through woods, 

channels. Consequently, a queuing routine must be used for l } n ****** m a such « ^ Ration 

defining priority chaonels as the insertion process proceeds. 20 dunn S * c argument, letting the audio level during the 

The present invention contemplates that insertion controller runnm 8 s**™ faU where * e director's artistic judgment 

46 be programmed to define and select priority channels to P 1 *? 6 Adjusting to a louder signal (higher deviation) 

which audio demodulator 60 is tuned just prior to a com- during the running scene, is not appropriate. An algorithm 

merciai. As mentioned above, conventional insertion con- f° r gradually adjusting audio level (and for determining the 

trailers know the time that commercials will play within a 25 matcn ^vel of a commercial) is programmed into the 

minute or two in most cases. As such, some channels can be software of logic 42 and controller 46. Although various 

monitored between commercials and their audio deviation algorithms are possible, one that experience teaches may be 

optimized for average audio over a period other than the most appropriate with respect to system 20" of FIGS. 9 and 

short period just before the commercial (e.g. , other than time 10 will now be described. 

period t2-t3). In practice, this solution will produce sufficient M During program content, loudness is periodically moni- 

normalization in most cases because audio levels do not tared over several periods and attenuators 65 and 66 are 

change particularly fast adjusted while a program segment runs. A one minute (for 

Further, when commercials begin simultaneously on sev- example) timer in logic 42 starts each time level detector 40 

eral subscriber channels (a common occurrence), insertion indicates that the audio level has reached reference level LI. 

controller 46 selects these channels sequentially according 35 If the audio level reaches reference level LI more man, for 

to pre-defined priority routines. Although insertion control- example, five times during a minute, the level is turned down 

ier 46 may assign tuning priorities for audio demodulator 60 by increasing the attenuation of attenuators 65 and 66. The 

according to any number of queuing routines, including a adjustment is done in a minimum control increment so as not 

random assignment routine, a preferred priority routine for to be audible to the listener. For example, an increment of 

making priority assignments is as follows: the channels to be 40 0.5 dB is suitable. On the other hand, if the audio does not 

given the highest priority are chosen from a random selec- reach the maximum level at all during the minute, the 

tion of the channels whose level offsets are unknown (i.e., volume is increased by adjusting attenuators 65 and 66. Hie 

they have never been measured and stored); followed by an volume is increased by the same minimum amount, which is 

ordered selection of the remaining channels based on the not detectable by the listener. If another minute elapses 

length of time that its commercial has last played, with those 43 without the audio level reaching the reference level LI . logic 

having the longer periods given the higher priority. Using 42 adjusts attenuators 65 and 66 so that the audio level is 

this priority routine, commercials that have not yet played again increased by that minimum amount Such an algo- 

and. therefore, have not yet been normalized are given top rithm will result in the volume level ultimately being set to 

priority. Because there is more opportunity for equipment reference level LI as close as possible, 

level settings to drift over time, priorities are next assigned 50 Another algorithm would monitor the level with respect to 

based on the length of time since the commercial last played; reference level LI. and any time the audio level reaches 

the one with the longest time period of non-play assigned the reference level LI, logic 42 adjusts attenuators 65 and 66 so 

top priority. When mere is a conflict in priorities, the routine that the audio level is reduced almost instantaneously until 

makes random assignments from those involved in the it drops below reference level LI. If reference level LI is not 

conflict - 55 reached again for some length of time, such as one minute. 

Quality control functions are possible for systems of the logic 42 adjusts attenuators 65 and 66 so that the audio level 

type shown in FIGS. 9 and U0. It is known by viewers of increases by the same small increment as above. The 

cable television that the audio level from one channel to increase is repeated at intervals of a minute or so. until 

another is not always consistent This is usually caused by reference level LI is again reached. This type of algorithm 

head-end set-up errors. The set-up errors are. in turn, caused 60 forms a fast attack, slow decay volume control strategy, 

by excessive work load or insufficient training of personnel. which is often preferred for volume control downstream of 

or by limitations in equipment By sequentially monitoring where the artistic qualities of the program are set In general 

the level on all channels, the audio level of all channels can this is the type of algorithm that is favored in this circum- 

be made the same, solving a long-standing complaint against stance. 

conventional cable systems. 63 In certain circumstances the artistic qualities may be 

Another quality control function is contemplated using changed by bringing the volume up more rapidly if it does 

the standard pilot level on stereo channels. It is generally not reach reference level LI. This changes the control 
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strategy to what is often called "compression It is generally 
not preferred unless typical listening conditions are different 
from those anticipated during creation of the program. 

Each audio transmission, be it part of a commercial or an 
eatertainment program, has a characteristic audio-level sig- 
nature that may be derived from level detector 4G. FIGS. 11 
and 12 illustrate a verification system that uses such audio- 
level signatures to automatically determine if the content of 
the transmission corresponds to the content that was 
intended. This function is of particular importance in adver- 
tising where it is desirable to verify that a particular com- 
mercial was transmitted at the proper time on the proper 
channel. 

Referring now to FIGS. 11 and 12. verification system 
109 includes digital sampler 110 which outputs a unique 
series of audio level samples over predefined time periods 
for each commercial. EM glial processor 111 receives these 
samples from sampler 110 while receiving commercial ID 
data and trigger pulses from insertion controller 46 in 
monitor STEP 120 and data STEPS 121. Processor 111 also 
connects to memory 112, correlation detector 113 and alarm 
114. Generally, processor 111 stores the signatures for all 
commercials that are to be played in a sig nature table in 
memory 112. via the YES path of decision STEP 122 and 
store STEP 123. These signatures are stored, along with their 
commercial ID data, the first time that each commercial is 
played by the CTV system, or, alternatively, they may be 
imported to memory 112 from data obtained when the 
commercials are recorded. Processor 111 performs corre- 
sponding auto correlations on each of the signatures and 
stores the auto correlation results in the signature table. Still 
further, for each commercial, processor 111 finds the stan- 
dard deviation for the set of samples that make up its 
signature. These standard deviation values, which are mea- 
sures of the extent that the set of samples of each signature 
deviates from its mean, are also calculated and stored in the 
signature table during store STEP 123. 

Later, when a particular commercial plays, processor HI 
stores, in store STEP 125, its commercial ID data and 
signature in a verification table to be used for verification. To 
verify that a particular commercial that was played, say 
commercial OC was the desired commercial, say commer- 
cial CI. processor 111 retrieves the CX signature of the 
unknown commercial from the verification table. Processor 
111 performs, in calculate STEP 126. a correlation of the 
unknown signature CX with the known signature CI. Pro- 
cessor 111 then compares, in decision STEP 128, the auto- 
correlation Cl/Clwith the correlation Cl/CX and the stan- 
dard deviation of commercial CI. If the autocorrelation 
Cl/Cland correlation Cl/CX are of comparable value and 
high as compared to the standard deviation of commercial 
CI. then there is a strong likelihood that commercial CX 
corresponds to commercial CI. However, if the autocorre- 
lation Cl/Clis high compared with the standard deviation of 
commercial CI while correlation Cl/CX is low compared 
with the standard deviation of commercial CI. processor 111 
fails to verify commercial CX and activates alarm 114. Of 
course, the autocorrelation results of the stored signatures 
need not be stored in signature table, but instead calculation 
STEP 126 may include the process of obtaining both the 
desired autocorrelation Cl/Cland the apparent cross- 
correlation Cl/CX before obtaining verification via verify 
STEP 128. 

To maximize system performance, it is contemplated 
further that level detector 40 preferably measure the sub- 
jective loudness of the audio. This contrasts with other 
measuring options, such as measuring peak audio levels. In 
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the instant invention level detector 40 preferably obtains a 
metric of the audio level that correlates well with the way 
humans subjectively perceive the loudness of audio program 
materiaL 

5 Measuring the subjective loudness of an audio signal is a 
difficult undertaking. One of the earlier known standards for 
doing this is the volume unit (VU) meter developed coop- 
eratively by the telephone and broadcast industries in 1939, 
and still in use today. The VU meter standard was developed 

io based on the capabilities of the D'Arsonval meter move- 
ments available at that time. That the standard remains in use 
today is more of a testament to the difficulty of measuring 
subjective loudness, than it is to the validity of the standard 
as a measure of loudness. Those skilled in these arts nor- 

is mally use this standard only for general guidance while 
imposing their own subjective judgment as to how loud a 
signal sounds. 

The European Broadcast Union (EBU) has for some years 
used a peak reading meter to measure the loudness of a 

20 signal. Some authorities propose that the EBU method is 
superior to using a VU meter because it relates nicely to the 
needs of engineers monitoring the peak level of a signal for 
transmission purposes. However, researchers have sought 
better methods for measuring an audio signal that correlate 

25 with subjective loudness. FIG. 13 shows a schematic of a 
preferred circuit suitable for use as level detector 40 for 
measuring subjective loudness of an audio signal. 

Level detector 4® is a particularly critical circuit, in mat 
it must produce a voltage that varies proportionally to the 

30 subjective loudness heard by a typical listener. Level detec- 
tor 4©. as depicted in FIG. 13. combines a close approxi- 
mation of a standard A-weighting curve, such as shown in 
FIG. 14. with a factor reflecting the observation mat fast, 
transitory sound is not perceived as loudly as would be the 

35 same sound if it lasted longer. These concepts are discussed 
in the following publications: Benson, Audio Engineering 
Handbook, 1988, pp 1-38 to 1-39; and Burden, et al., "A 
Different Approach to the Old Problem of Audio Level 
Monitoring" 84th Convention of the Audio Engineering 
Society. March 1-4, Paris. 1988, pp 1 to 8. 

The A-weighting curve plots response vs. frequency such 
that its values closely resemble the typical response of the 
human ear. Filtering circuits having a response that approxi- 

45 mates the A-weigbting curve have been used in audio level 
measurements as a means of pre-filtering a signal such that 
subsequent detection of volume level will be roughly equal 
to the frequency response exhibited by the human ear. 
Further, it is known that humans generally perceive loudness 

so partially as a function of duration of the sound, with tran- 
sitory signals sounding progressively louder until the dura- 
tion exceeds about 200 micro-seconds. Beyond this period, 
the perceived loudness no longer varies as a function of 
duration. 

55 FIG. 13 depicts the specific circuit elements for an imple- 
mentation of level detector 40 that was tested and compared 
to an A-weighting curve as shown in FIG. 14. The circuit 
schematic of FIG. 13 places parameter values for the various 
elements adjacent the dement symbols. In FIG. 13. capaci- 

50 tance values are given in microfarads and resistance values 
are in ohms. The reference markings adjacent diodes D1-D4 
and amplifiers U1A, U1B, U1C. U1D and U2A are conven- 
tional identifications of these elements. Transistor Ql is a 
conventional NPN transistor. 

65 The audio signal monitored by level detector 44) appears 
at input tenuinal 114®. Amplifier U1A and the associated 
elements constitute an active band-pass filter 141. Generally. 
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filter 141 is a conventional circuit with the addition of 
capacitor C3 which creates a portion of the rolloff required 
by the A- weighting curve. From filter 141. the signal passes 
to low-pass filter 142 comprising amplifier U1B and the 
associated components. Again, this circuit is known to those 5 
stalled in the art High-pass filter 143. comprising capacitor 
C5 and resistor R8. provides a portion of the rolloff at low 
frequencies required by the response of the A-weighting 
curve. 

The output of high-pass filter 143 passes to full wave 10 
rectifier 144. consisting of amplifiers U1C and U1D and 
their associated components. Again, this circuit is familiar to 
those skilled in the art The output at pin 8 of U1C is a full 
wave rectified representation of the audio signal at input 
terminal 14© after it has passed through filters 141-143. The is 
R-C circuit, formed by resistor R16 and capacitor C6, limits 
the attack time of the circuit to limit the effect on the audio 
level of a momentary high level sound. The time constant of 
the R-C circuit is much shorter than the maximum duration 
proposed by the Benson publication cited above. This is 20 
done to allow a certain degree of fast attack characteristic to 
pass to the measuring facility. 

Finally, dB circuit 145, comprising amplifier U2A and 
associated components, converts the output of rectifier 144 
to decibel or logarithmic representation. This circuit is also 25 
known to those skilled in the art Output terminal 146 passes 
the signal to comparison amplifier 32. FIG. 14 shows the 
A-weighting curve as a solid line and the test values of the 
filter response of level detector 40 as X's. As can be seen 
from inspection, there is a close correlation between the 30 
responses of the A-weighting curve and the test results of the 
FIG. 13 implementation of level detector 40. 

It is to be understood, therefore, that within the scope of 
the appended claims, the invention may be practiced other- 
wise than as specifically described. 35 

What is claimed is: 

1. A signal processing system having a normalized output 
signal comprising: 
a first signal source; w 
a second signal source; 

a signal combining means connected to said first and 
second signal sources for forming an output signal by 
linking signal segments derived from said first and 
second signal sources into a series of said signal 45 
segments; 

a level processor means connected to said signal combin- 
ing means for detennining a level of intensity of said 
output signal, wherein said level processor means gen- 
erates a target level and an error level related to the so 
difference between said target level and said level of 
intensity of said output signal and. wherein said level 
processor means includes level storage means fox stor- 
ing said error level and the corresponding source of 
said signal segment; and 55 

a level adjusting means connected to at least one of said 
signal sources and responsive to said level processor 
means for adjusting a level of intensity of said signal 
segments from said at least one of said signal sources 
such that said level of intensity of said output signal is 60 
normalized, wherein said level adjusting means adjusts 
said level of intensity of said signal segments as a 
function of said error level, and further wherein said 
level adjusting means adjusts said level of intensity of 
said signal segments as a function of corresponding 65 
ones of said error level stored in said level storage 
means. 



is 

2. The system of claim 1 wherein said target level is a 
function of said level of intensity of said output signal for 
signal segments derived from said first signal source. 

3. The system of claim 1 wherein said level processor 
means includes a reference level means for generating said 
target level at a fixed predetennined value. 

4. A signal processing system having a normalized output 
signal comprising: 

a first signal source; 
a second signal source; 

a signal combining means connected to said first and 
second signal sources for forming an output signal by 
linking signal segments derived from said first and 
second signal sources into a series of said signal 
segments; 

a level processor means connected to said signal combin- 
ing means for determining a level of intensity of said 
output signal, wherein said level processor means fur- 
ther includes a reference level means for generating a 
predetermined fixed reference level, and said level 
processor means determines an error level related to the 
difference between said level of intensity of said output 
signal and said fixed reference level; and 

a level adjusting means connected to at least one of said 
signal sources and responsive to said level processor 
means for adjusting a level of intensity of said signal 
segments from said at least one of said signal sources 
such that said level of intensity of said output signal is 
normalized and, said level adjusting means further 
connected to said first and second sources for adjusting 
said level of intensity of said signal segments at cor- 
responding ones of said signal sources. 

5. The system of claim 4 wherein said level processor 
means determines said level of intensity of said output signal 
for signal segments derived from said first signal source, and 
said level adjusting means adjusts said level of intensity of 
said signal segments at said second signal source as a 
function of said level of intensity of a preceding signal 
segment from said first signal source. 

6. A signal processing system having a normalized output 
signal comprising: 

a first signal source, wherein said first signal source 
includes cue means for marking borders of said signal 
segments; 

a second signal source, said second signal source being 
responsive to said cue means for generating signal 
segments and for causing said signal combining means 
to link said signal segments at said borders; 

a signal combining means connected to said first and 
second signal sources for forming an output signal by 
linking signal segments derived from said first and 
second signal sources into a series of said signal 
segments; 

a level processor means connected to said signal combin- 
ing means for determining a level of intensity of said 
output signal, wherein said level processor means gen- 
erates a target level and an error level related to the 
difference between said target level and said level of 
intensity of said output signal; and 

a level adjusting means connected to at least one of said 
signal sources and responsive to said level processor 
means for adjusting a level of intensity of said signal 
segments from said at least one of said signal sources 
such that said level of intensity of said output signal is 
normalized, wherein said level adjusting means adjusts 
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said level of intensity of said signal segments as a 
function of said error level. 

7. The system of claim 6 wherein said level adjusting 
means adjusts said level of intensity of said signal segments 
at said borders of said signal segments. 

8. The system of claim 6 wherein said first and second 
signal sources include audio signals and said level of inten- 
sity corresponds to the loudness of said audio signals. 

9. A signal transmission system for producing a composite 
output signal formed by linking signals from a plurality of 
signal sources comprising: 

a first signal source means for generating a series of first 
signal segments and cue tones indicating the borders of 
said first signal segments; 

a second signal source means connected to said first signal 
source means and responsive to said cue tones for 
generating a series of second signal segments; 

a signal combining means connected to said first and 
second signal source means for forming said composite 
output signal by alternately linking said first signal 
segments with said second signal segments; 

a level processor means connected to said signal combin- 
ing means for determining a level of intensity of said 
composite output signal; and 

a level adjusting means connected to at least one of said 
signal source means and responsive to said level pro- 
cessor means for adjusting a level of intensity of signal 
segments from said at least one of said signal source 
means such that said level of intensity of said output 
signal is normalized. 

10. The system of claim 9 wherein said first signal 
segments and said second signal segments include audio 
signals and said level of intensity corresponds to the loud- 
ness of said audio signals. 

11. The system of claim 10 wherein said level processor 35 
means determines the loudness of a portion of said first 
signal segments, and said level adjusting means adjusts the 
loudness of said second signal segments as a function of said 
loudness of said portion of said first signal segments. 

12. The system of claim 11 wherein said level processor 40 
means generates a target loudness level and an error level 
related to the difference between said target loudness level 
and the loudness of said output signal, and said level 
adjusting means adjusts said loudness of said output signal 
as a function of said error level. 

13. The system of claim 12 wherein said target loudness 
level is a function of said loudness of portions of said first 
signal segments. 

14. The system of claim 12 wherein said level processor 
means includes a reference level means for generating said 50 
target level at a fixed predetermined value. 

15. The system of claim 12 wherein said level processor 
means includes level storage means for storing said error 
levels, and said level adjusting means adjusts the loudness of 
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18. The system of claim 17 wherein said composite output 
signal includes a plurality of said second signal segments 
linked between successive ones of said first signal segments, 
said level processor means determines the loudness of each 
of said plurality of said second signal segments, and said 
level adjusting means adjusts said loudness for each of said 
plurality of said second signal segments. 

19. The system of claim 18 wherein said level processor 
means includes signal signature means for storing predeter- 
mined loudness signatures for said signal segments, and for 
correlating a plurality of samples of said loudness for one of 
said signal segments with a corresponding one of said 
signatures to verify the transmission of said one of said 
signal segments. 

). The system of claim 19 wherein said level processor 
means includes loudness detector means having a frequency 
response that substantially resembles the typical response of 
a human ear. 

21. A signal transmission system for transmitting an 
20 output signal comprising: 

a plurality of signal channels* each said channel compris- 
ing: 

a first signal source means for generating a series of 
first signal segments and cue tones indicating the 
borders of said first signal segments; 
a second signal source means connected to said first 
signal source means and responsive to said cue tones 
for generating a series of second signal segments; 
a first signal combining means connected to said first 
and second signal source means for forming a chan- 
nel output formed by alternately linking said first 
signal segments with said second signal segments; 
a second signal combining means connected to each of 
said first signal combining means for combining said 
channel outputs for simultaneous transmission; 
a level processor means connected to said second signal 
combining means for detennining a level of intensity of 
signals in one of said *-h«Tin>p.1 outputs; and 
a level adjusting means connected to at least one of said 
signal source means in each said channel and respon- 
sive to said level processor means for adjusting a level 
of intensity of said signal segments from said at least 
one of said signal source means in one of said channels 
such that said level of intensity of said channel output 
is normalized. 

22. The system of claim 21 wherein each said first signal 
combining means includes a modulator means for modulat- 
ing said channel output onto a carrier signal, and said level 
processor means includes a tunable loudness detector means 
for selectively detecting loudness for each of said channel 
outputs. 

23. The system of claim 22 wherein said level processor 
means determines said level of intensity of said signal 
segments of one of said channel outputs, and said level 
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said error levels stored in said level storage means. 

16. The system of claim 12 wherein said level adjusting 
means is connected to said first and second source means for 
adjusting said loudness of said first signal segments and said 
second signal segments. 60 

17. The system of claim 12 wherein said first signal source 
means includes cue means for marking borders of said first 
signal segments, and said second signal source means being 
responsive to said cue means for generating said second 
signal segments and for causing said signal combining 65 
means to link at least one of said second signal segments to 
said first , signal segments at said borders. 



segments for the other of said channel outputs as a function 
of said level of intensity of said one of said channel outputs. 

24. A signal processing method for producing a normal- 
ized output signal comprising: 

generating a first signal from a first signal source; 
generating a second signal from a second signal source; 
combining said first and second signals to form an output 
signal by linking signal segments from said first and 
second signals into a series of signal segments; 
determining a level of intensity of said output signal 
including determining said level of intensity of said 
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output signal for signal segments derived from said first determining a level of intensity of said composite output 

signal and generating a target level and an error level signal; and 

related to the difference between said target level and adjusting a level of intensity of signal segments such that 

said level of intensity of said output signal; and ^ | evd of intensity 0 f sa id composite output signal is 

adjusting a level of intensity of said signal segments to 5 normalized 

produce said normalized output signal including adjust- 30. The method of claim 29 wherein said steps of gener- 

ing said level of intensity of signal segments from said atin g a of ^ signal segments an( i generating a series 

second signal as a function of said level of intensity of of signal segments includes generating audio signals, 

a preceding signal segment from said ^ first signal, and ^ saM lcycl of t0 me loudness of 

further including adjusting said level of intensity of 10 $jdd 

said signal segments as a function of said error level. ^j^nS'ct claim 29 wherein said detennining step 

25. The method of claim 24 wherein said detennining step . , * . . TT , ' y~; 7^ * \ 
includes generating said target level as a function of said toc ™^g the loudness for a portion of said first 
level of intensity of said output signal for signal segments *W? ^ d said adjusting step includes adjusting 
derived from said first signal. " me loudncss of smd *«Md s 'gnal segments as a function of 

26. Hie method of claim 24 further including storing said said loudness of said portion of said first signal segments, 
error levels and the corresponding source of said signal 31 The method of claim 3® wherein said determining step 
segments, and wherein said adjusting step includes adjusting includes generating a target loudness level and an error level 
said level of intensity of said signal segments as a function related to the difference between said target loudness level 
of said error levels stored in said storing step. 20 and the loudness of said composite output signal, and said 

27. A signal processing method for producing a normal- adjusting step includes adjusting said loudness of said output 
ized output signal comprising: signal as a function of said error level. 

generating a first signal from a first signal source; 33. The method of claim 32 further including storing said 

generating a second signal from a second signal source; e™* levds - afld wherein said adjusting step includes adjust- 

combiaing said first and second signals to form an output *f loudncss °f f** signal segments as a function of said 

signal by linldng signal segment from said first and cm ^M m ^ Mta^^ 

second signals into a series of signal segments; ™ e of clami 29 whercin ad J ustlll 8 stc P 

deterrnining a level of intensity of said output signal. ^^f^V^U^^^^^^^^ 

wherein said deter mining step includes generating a 30 ccono "gflai- 

predetermined fixed reference level and detennining . ^ememcdof daim 30 wh«m 

an error level related to the difference between said boring predetermined loudness signatures for said 

level of intensity of said output signal and said fixed sl S nal segments, and correlating one a plurality of samples 

reference level* and °f said loudness for one of said signal segments with a 

adjusting a level of intensity of said signal segments to 35 corresponding ; one of said loudness signatures to verify the 

produce said normalized output signal, said adjusting transmission of said one of said signal segments, 

step including adjusting said level of intensity of said M ' ^ of claim 35 wherein said detennining step 

signal segments for corresponding ones of said first and includes detecting said loudness as a function of the fre- 

second signals. quency response of the typical response of a human ear. 

28. A signal processing method for producing a normal- 40 37. A signal transmission method for transmitting an 
ized output signal comprising: output signal comprising: 

generating a first signal from a first signal source, wherein generating composite output signals in each of a plurality 

said step of generating a first signal includes generating of signal channels comprising: 

cue tones for marking borders of said signal segments; generating a series of first signal segments and cues 

generating a second signal from a second signal source. 45 indicating the borders of said first signal segments; 

wherein said step of generating a second signal generating a series of second signal segments in 

includes generating signal segments responsive to said response to said cues; and 

cue tones for combining said signal segments at said f<nmu, S • channel output by alternately lintang said 

borders. first signal segments with said second signal seg- 

.. . „ , 50 meats at said borders in response to said cues; 

combining said first and second signals to form an output ... ... . . . . 

signal by linking signal segments from said first and combining said channel outputs for simultaneous trans- 

second signals into a series of signal segments; mission; 

determining a level of intensity of said output signal; and <letonimmg * level of intensity of signals in one of said 

channel outputs; and 

adjusting a level of intensity of said signal segments to 55 . ■ . * . . , ^ * ., ^ 

produce said normalized output signal adjusting a level of intensity of each of said channel 

29. A signal transmission method for producing a com- oujutsasa fonctionof ^said one said channel outputs 
posite output signal by linking signals from a plurality of sudl mat ^ d levels of intensity of said channel outputs 
signal sources cwnprising: m nonnallzcd - 

. . , go 38. The system of claim 37 wherein said step of forming 

generating a series^ oil first signa^ segments and cue tones a modu|ati said si ^ nls 

indicating the borders of said first signal segments; onto a ^^signaL and said deterrrnningTtep includes 

generating a series of second signal segments in response detecting and demodulating a preselected one of said chan- 

to said cue tones; nel outputs for selectively determining loudness for each of 

forming said composite output signal by alternately link- $5 said channel outputs. 

ing said first signal segments with said second signal 

segments; * # * * * 
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